Finding Differentially Covarying Needles in a Temporally Evolving Haystack: A Scan Statistics Perspective
نویسندگان
چکیده
Recent results in coupled or temporal graphical models offer schemes for estimating the relationship structure between features when the data come from related (but distinct) longitudinal sources. A novel application of these ideas is for analyzing group-level differences, i.e., in identifying if trends of estimated objects (e.g., covariance or precision matrices) are different across disparate conditions (e.g., gender or disease). Often, poor effect sizes make detecting the differential signal over the full set of features difficult: for example, dependencies between only a subset of features may manifest differently across groups. In this work, we first give a parametric model for estimating trends in the space of SPD matrices as a function of one or more covariates. We then generalize scan statistics to graph structures, to search over distinct subsets of features (graph partitions) whose temporal dependency structure may show statistically significant group-wise differences. We theoretically analyze the Family Wise Error Rate (FWER) and bounds on Type 1 and Type 2 error. On a cohort of individuals with risk factors for Alzheimer’s disease (but otherwise cognitively healthy), we find scientifically interesting group differences where the default analysis, i.e., models estimated on the full graph, do not survive reasonable significance thresholds.
منابع مشابه
The Needles-in-Haystack Problem
We consider a new data mining problem of detecting the members of a rare class of data, the needles, that have been hidden in a set of records, the haystack. Besides the haystack, a single instance of a needle is given. It is assumed that members of the needle class are similar according to an unknown needle characterization. The goal is to find the needle records hidden in the haystack. This p...
متن کاملProcess rather than pattern: finding pine needles in the coevolutionary haystack
The geographic mosaic theory is fast becoming a unifying framework for coevolutionary studies. A recent experimental study of interactions between pines and mycorrhizal fungi in BMC Biology is the first to rigorously test geographical selection mosaics, one of the cornerstones of the theory.
متن کاملThe haystack is made of needles.
Developing genetic tests that have clinical utility and validated biomarkers presents many challenges. Much has been written about these challenges for the development of genetic test evidence (Khoury et al., 2010; Horn and Terry, 2012) and biomarker validation (Lesko and Atkinson, 2001; Surh, 2009). One consistent thread through these challenges is the lack of well-characterized cohorts. This ...
متن کاملGuest Editors' Introduction: Information Discovery--Needles and Haystacks
For thousands of years, people have realized the importance of archiving and finding information. With the advent of computers, it became possible to store large amounts of information in electronic form — and finding useful needles in the resulting haystacks has since become one of the most important problems in information management. Many systems exist to help users navigate the considerable...
متن کاملFinding the epistasis needles in the genome-wide haystack.
Genome-wide association studies (GWAS) have dominated the field of human genetics for the past 10 years. This study design allows for an unbiased, dense exploration of the genome and provides researchers with a vast array of SNPs to look for association with their trait or disease of interest. GWAS has been referred to as finding needles in a haystack and while many of these "needles," or SNPs ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1711.07575 شماره
صفحات -
تاریخ انتشار 2017